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ABSTRACT 

Polyadenylation of eukaryotic mRNAs contributes to 
stability, transport and translation, and is catalyzed 
by a large complex of conserved proteins. The Pcf11 
subunit of the yeast CF IA factor functions as a 
scaffold for the processing machinery during the 
termination and polyadenylation of transcripts. 
Its partner, Clp1, is needed for mRNA processing, 
but its precise molecular role has remained enig- 
matic. We show that Clp1 interacts with the 
Cleavage-Polyadenylation Factor (CPF) through its 
N-terminal and central domains, and thus provides 
cross-factor connections within the processing 
complex. Clp1 is known to bind ATP, consistent 
with the reported RNA kinase activity of human 
Clp1. However, substitution of conserved amino 
acids in the ATP-binding site did not affect cell 
growth, suggesting that the essential function of 
yeast Clp1 does not involve ATP hydrolysis. 
Surprisingly, non-viable mutations predicted to 
displace ATP did not affect ATP binding but dis- 
turbed the Clp1-Pcf11 interaction. In support of 
the importance of this interaction, a mutation in 
Pcf11 that disrupts the Clp1 contact caused 
defects in growth, 3-end processing and tran- 
scription termination. These results define Clp1 
as a bridge between CF IA and CPF and indicate 
that the Clp1-Pcf11 interaction is modulated by 
amino acids in the conserved ATP-binding site 
of Clp1. 



INTRODUCTION 

Poly(A) tails are added post-transcriptionally to nuclear 
pre-mRNA Spends in eukaryotic cells in a two-step 
reaction involving cleavage in the 3'-untranslated region 
and extension of the new 3'-end by poly(A) synthesis (1,2). 
Polyadenylation helps the mRNA function efficiently in 
protein synthesis and prevents its premature degradation 
before it has performed this task. Besides being an essen- 
tial step in mRNA maturation, it is also a point at which 
the cell controls gene expression (2-4). Inappropriate 
3'-end processing can contribute to human diseases (5), 
and modulate the expression of oncogenes (6). Poly- 
adenylation factors are also targeted by some viruses to 
limit expression of host cell mRNAs and favor viral 
protein production (7-10). 

The complex that catalyzes this processing is well 
conserved from yeast to humans, although the individual 
subunits separate into somewhat different subcomplexes 
upon biochemical purification (1,2). In Sachharomyces 
cerevisiae, cleavage requires the 4-subunit Cleavage- 
polyadenylation Factor IA (CF IA), the 15-subunit 
Cleavage-Polyadenylation Factor (CPF) and Hrpl- 
Nab4, while tail synthesis requires these factors plus the 
Pabl and Nab2 poly(A)-binding proteins, which restrict 
tail length (11,12). CPF recognizes sequences around the 
poly (A) site and contains the poly(A) polymerase Papl 
and the Yshl nuclease, which are the enzymatic activities 
that act at the cleavage site, while CF I A and Hrpl help 
position CPF at the poly(A) site. These factors are also 
necessary to insure that RNA Polymerase II transcription 
terminates in a timely fashion and does not interfere with 
the function of downstream DNA elements such as pro- 
moters, centromeres and origins of replication or lead to 
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the production of anti-sense RNA and possible gene 
silencing (13). A current challenge in the area of mRNA 
3'-end processing is to improve our understanding of how 
the architecture of the cleavage-polyadenylation complex 
contributes to function and to integration of this process- 
ing machinery with other nuclear events. 

In this regard, the essential Pcf 1 1 component of CF IA 
is one of the better-studied subunits, and is required for 
both the cleavage and poly (A) addition steps. It is the only 
subunit of CF IA that contacts each of the other three 
subunits — Rnal4, Rnal5 and Clpl (13-16), and it is 
thought to be one of the scaffold proteins of the process- 
ing complex. In support of such a role, Pcf 1 1 also interacts 
with the Ptal and Ssu72 subunits of CPF (17-19) and with 
the phosphorylated C-terminal domain (CTD) of the 
Rpbl subunit of RNAP II (20,21), thus connecting the 
processing machinery to the transcription elongation 
complex. These interactions are mediated through 
a CTD interaction domain (CID) at its N-terminus, a 
central domain for Rnal4-Rnal5 interaction, and a 
C-terminal Clpl interaction domain (Figure IA). 
Mutations in the CID impair efficient termination of 
snoRNA and short mRNA transcripts, while mutation 
elsewhere affects the 3'-end processing and termination 
of long mRNAs (13-16). 

Like Pcfll, the Rnal4 and Rnal5 subunits of CF I A 
are also required for 3'-end processing and termination. 
Rnal5 recognizes the A-rich processing signal located 
upstream of the poly(A) site (22) and contact between its 
RNA Recognition Motif (RRM) and that of Hrpl at the 
U A-rich signal is induced upon RNA binding (23). The 
tight interaction of Rnal5 with Rnal4 increases its affinity 
for RNA (24,25), and an Rnal4-mediated bridge to Hrpl 
helps direct Rnal5 to the correct binding site (22). 

In contrast to other CF IA subunits, much less is known 
about the precise function of yeast Clpl other than being 
required for both cleavage and poly(A) addition (26). 
Discovered in 1997 by Minvielle-Sebastia et al. (27), it 
interacts in pull-down experiments with Cftl, Cft2, Ptal 
and Pcfll (17,19,26). Surprisingly, Clpl is the best 
characterized CF IA subunit in terms of having the most 
complete 3D structure. It contains a large central domain 
that binds ATP and is flanked by two smaller N- and 
C-terminal domains (28). Part of the central domain 
surface and a hydrophobic cleft formed by the central 
and CTD create the Pcf 1 1 -binding site. The conserved 
human Clpl, and an archeal homolog, but not yeast 
Clpl, are 5 ; -OH polynucleotide kinases, and this activity 
in humans is important in tRNA splicing (28-32). 

The goal of the work described here is to more com- 
pletely define the role of Clpl in the processing complex. 
In this study, we identified new interactions between Clpl 
and components of the processing machinery and found 
that the N- and C-terminal domains of Clpl are essential 
for cell viability. In addition, specific mutations in the 
ATP-binding site are lethal, and surprisingly, interfere 
with Pcfll interaction. Mutations in Pcfll that perturb 
Clpl binding cause temperature-sensitive growth and 
affect cleavage, polyadenylation and transcription 
termination. 



MATERIALS AND METHODS 

Yeast strains and culture 

The S. cerevisiae strains used in this study are as follows. 
Strain CM246 (M AT uMhis3 A 1 leu2Al lys2A0 ura3A0 
clph\KanMX4 [YCpLac33-CLP7]) was constructed by 
transforming a diploid strain with a chromosomal copy 
of CLP1 disrupted with KanMX4 module (dp l:\KanMX) 
with a YCpURA3-CLPl plasmid, and then sporulating to 
create the haploid strain. The PCF 11 (NA53), pcf 11-9 
(NA67) and pJ69-4A strains were described previously 
(15,33,34). To determine the effect of clpl and pcfll mu- 
tations on yeast cell growth, the plasmid shuffle comple- 
mentation assay was used (35). The pRS315 plasmids 
containing clpl mutations were analyzed in the CLP1 
plasmid shuffle strain CM246, and the pRS315 plasmids 
containing pcfll mutations were analyzed in the PCfll 
plasmid shuffle strain NA53. Growth properties were 
analyzed by growing the strains in liquid YPD at room 
temperature to an optical density at 600 nm of 1.0, 
spotting 5ul of 10-fold serial dilutions on YPD plates 
and incubating the plates for 3 days at 16, 24, 30, 37 or 
39°C. 

Saccharomyces cerevisiae CLP1 and PCF11 plasmids 
and mutants 

CLP1 plasmids. For YCpLac33-clpl, the coding sequence 
of CLP1 was amplified from genomic DNA by PCR. The 
PCR product was digested with Ndel and Xhol, and 
cloned into the YCpLac33 (URA3) vector flanked by the 
800-bp upstream and downstream of the coding region of 
the Ythl gene. pRS315-CLP7 and its truncations were 
constructed by PCR from YCpLac33-clpl and cloned by 
Nhel and Ncol into pRS315 (LEU2). The CLP1 sequence 
is flanked by a Myc epitope tag at the N-terminus and a 
V5 tag at the C-terminus and under the control of the 
MPEI promoter and terminator sequences (900-bp 
upstream and downstream of the coding region of the 
MPEI gene). Glutathione S-transferase (GST) fusion 
Clpl and its truncations were constructed by insertion of 
PCR-amplified fragments into the EcoRI and NotI of 
pGEX-6 P-2. For yeast two hybrid analyses, the CLP1 
coding sequence was cloned in pGAD-C2 using EcoRI 
and Pstl. 

Pcfll plasmids. pFL38-pcfll and pcfl 1-pGBDU-Cl were 
described previously (15,33,34,36). pRS315-pcfll was 
constructed by PCR of the coding region of PCF11 gene 
from genomic DNA and cloned by Nhel and Ncol into 
pRS315-Clpl digested with the same enzymes. 

Clpl and Pcfll site - directed mutants. Plasmids contain- 
ing site-directed amino acid substitutions in Clpl or Pcfl 1 
were constructed using the QuikChange Mutagenesis Kit 
(Stratagene). All DNA fragments generated by PCR amp- 
lification in this study were verified by DNA sequencing. 

Western blot analyses and immunoprecipitations 

The steady-state levels of specific proteins were 
determined by western blot using cells grown to 
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Figure 1. Functional analysis of Clpl domains. (A) Schematic of mutations analyzed in this study. The linear representation of Pcfll indicates the 
CID, the Q-rich domain (Q20), the regions which interact with Rnal4, Rnal5 and Clpl, the two regions containing zinc fingers (ZF), and the pos- 
itions of the R480A and W482A mutations. The Clpl representation shows the three domains determined by structural analysis (28), the N-terminal 
and C-terminal deletions, and the positions of mutations in the Pcfll interaction interface and the ATP-binding site. (B) The amino and carboxyl 
termini of Clpl are essential for cell viability. The CM246 [YCpU RA3-CLP 1] plasmid shuffle strain expressing epitope-tagged versions of wild-type 
Clpl or the indicated deletion derivatives on a L^C/2-marked plasmid were streaked on 5-FOA medium to counterselect the YCpURA3-CLPl 
plasmid. Plates were photographed following incubation for 4 days at 24°C. (C) In vivo expression of truncated forms of Clpl. Extracts prepared 
from strains expressing either full-length Clpl or the indicated deletion derivatives and grown on selective medium were analyzed by western blot 
using monoclonal antibody against the V5 tag on Clpl or the CPF subunit Ptal as a loading control. Asterisk indicates non-specific band detected by 
the V5 antibody. (D) Clpl interacts with specific CPF subunits. 35 S-labeled in vzYro-translated subunits of CPF were incubated with GST, GST-Clpl, 
GST-clpl(101^45) or GST-clpl(l-341) bound to glutathione-Sepharose beads. The bound proteins and 10% of the input were separated on a 10% 
polyacrylamide-SDS gel and detected by autoradiography. 



mid-exponential phase in liquid media and extracts 
prepared as described previously (37). Extracts (50 jig of 
protein) were resolved on an SDS-10% polyacrylamide 
gel, and proteins were transferred to Immobilon-P 



polyvinylidene difluoride membrane (Millipore). 
Monoclonal antibody against Ptal was described previ- 
ously (38). Anti-V5 antibody was obtained from 
Invitrogen, Glutathione S-transferase (GST) antibody 
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from BD Biosciences-Pharmingen, GAL4-DBD (SC-510) 
and GAL4-TA (SC-1663) antibodies from Santa Cruz 
Biotechnology. Immunoprecipitations from yeast extract 
were performed as described by Zhelkovsky et al. (39). 

Yeast two-hybrid analysis 

pGAD-C2 vector expressing the Gal4 activation domain 
fused to wild-type Clpl were transformed into the 
PJ69-4A host strain. The resulting strains were trans- 
formed with the pGBDU-C2 vector expressing the Gal4 
DNA-binding domain fused to wild-type and mutant 
Pcfll. Transformants were selected on complete medium 
lacking uracil and leucine (CM-UL). Protein-protein 
interactions were scored by the ability of cells to grow 
on complete medium lacking uracil, leucine and histidine 
(CM-ULH) as described (17). 

Recombinant proteins and in vitro protein-protein 
interaction assays 

Radiolabeled proteins used in this study were generated 
in vitro with the TNT rabbit reticulocyte lysate system 
(Promega) in the presence of [ 35 S]-methionine in a total 
volume of 50 ul. Expression and purification of recombin- 
ant GST-tagged proteins from the Escherichia coli Rosetta 
(DE3), as well as in vitro binding experiments were carried 
out as described previously (17,26). To produce recombin- 
ant CF I A, Rnal4 and Rnal5 were co-expressed from a 
pETDuet vector as described previously (24). Clpl and 
Pcfll were cloned into and then co-expressed from a 
pRSFDuet vector (Novagen). Pcfll was cloned into the 
site 1 position within the vector using BamHl and Notl 
restriction enzymes. Clpl was cloned into the second 
position using Ndel and Kpnl restriction sites. Protein 
expression in E. coli strain BL21 (DE3) pLysS was 
induced with 0.2 mM isopropyl 1-thio-P-D-galacto- 
pyranoside (IPTG) overnight at 16°C. Harvested cells 
were lysed in Tris-HCl, pH 8.0, 250 mM NaCl, 5% 
glycerol, 5mM [3ME, 5mM PMSF using the Avestin C5 
homogenizer and debris removed by centrifugation at 
30 000g. The CF IA complex was purified using a combin- 
ation of Ni-NTA, ion exchange chromatography and size 
exclusion chromatography. A detailed characterization of 
the CF I A complex will be presented elsewhere (J. 
Gordon, S. Shikov, J. Kuehner, M. Liriano, E. Lee, W. 
Stafford, C. Moore and A. Bohm, submitted for 
publication). 

In vitro and in vivo RNA analyses 

Preparation of yeast cell extracts, transcription of [oc- 32 P] 
UTP-labeled full-length GAL7-1 RNA or precleaved 
GAL7-9 RNA and processing assays have been described 
previously (17,38). 

P-Galactosidase assay 

Wild-type and mutant strains were transformed with 
various reporter vectors [pHZ18A2, no poly(A) site; 
pLIOl, ADH2 poly(A) site; pL501, GAL7 poly(A) site] 
(40). Cells were grown in synthetic medium lacking 
uracil but containing 2% galactose to an OD 60 o of 0.2 



and assayed by using the Gal-Screen System (Applied 
Biosystems). 

Chromatin immunoprecipitation 

Cross-linking, isolation of chromatin and sonication were 
performed as described previously (41). WT and mutant 
strains were grown to midlog (OD = 0.7-0.9) at 30°C, 
harvested, resuspended in prewarmed growth medium 
(30°C or 39°C), then incubated at either 30°C or 39°C 
for an additional 60min. The PCR was performed using 
2 jil of immunoprecipitated DNA or 0.4 ul of input DNA 
and the appropriate primer pairs. PMA1 primer pairs 
were described previously (42). PCR products were 
resolved in a 1.5% agarose gel and visualized by 
ethidium bromide staining using an Alphalmager 2000. 
Rpb3 occupancy of each of the indicated regions of 
PMA1 was calculated as a sample IP:input ratio and 
normalized to the sample IP:input ratio of an untrans- 
cribed region of chromosome V. Serial dilutions of chro- 
matin from samples not subject to immunoprecipitation 
were used to demonstrate that samples are analyzed in the 
linear range. 

ATP-binding assays 

The ATP resin-binding assay was based on the protocol of 
Haystead, et al. (43). y-Aminophenyl-adenosine triphos- 
phate immobilized on agarose (Jena Biosciences) was 
equilibrated in reaction buffer (40 mM Tris, pH 8.0, 
ImM MgCl 2 , 0.1% (3-mercaptoethanol, 0.1% NP40, 
150 mM NaCl). An amount of 500 ul of protein (10 uM) 
was incubated with 100 ul of resin at 4°C for 1 h. The resin 
was washed four times with 500 ul of reaction buffer and 
eluted with 500 ul of reaction buffer + 20 mM ATP. 

The ATP content of Clpl was determined using a modi- 
fication of the protocol described by Noble et al. (28). 
Equal amounts of recombinant protein were precipitated 
by addition of trichloroacetic acid (TCA) to 20% and the 
supernatant containing the released nucleotide removed 
and neutralized by addition of 175mM sodium acetate 
(pH 8.0). Extracted nucleotides were applied to a DNA 
Pac PA- 100 (Dionex) HPLC column equilibrated in 
lOmM Tris buffer, pH 8.0. The column retention time 
of the nucleotide after applying a gradient of 0-1 M 
NaCl over 30min was measured by monitoring the 
column eluent at 260 nm. Under these conditions, a 
control ATP sample eluted at 0.57 M NaCl. 

RESULTS 

Clpl interacts with CPF subunits 

The crystal structure of Clpl with bound ATP (28) sug- 
gested that the region formed by the first 100 amino acids 
of Clpl may act as a lid to cover the ATP-binding pocket 
of the central domain, while the C-terminal region (amino 
acids 341-445) contributes to Pcfll binding (Figure 1A). 
To better understand the role of the Clpl N- and 
C-termini, we tested the effect of deleting these domains 
on cell growth, using the plasmid shuffle assay. Mutants 
lacking the N- terminal domain {clpl -AN), or the 
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Figure 2. The pcfll (RW-A) mutant is thermosensitive for growth and 
defective for Clpl interaction. (A) Relative growth rates at different 
temperatures. Strains were grown in liquid YPD, and 10-fold serial 
dilutions were spotted on YPD plates and incubated for 2-3 days at 
the indicated temperatures. (B) In vivo expression of the pcfll (RW-A) 
mutant. Cells containing a chromosomal deletion of the PCF11 gene 
but expressing V5-tagged forms of either Pcf 1 1 or pcf 1 1 (RW-A) or an 
untagged version of wild-type Pcfll were grown at 30° C and then 
shifted to either 30°C or 39°C for an additional 60min. Extracts 
from these cells were analyzed by western blot using a monoclonal 
antibody against the V5 tag or Ptal. (C) Two-hybrid analysis of the 
interaction between pcfll (RW-A) and Clpl. For two-hybrid analysis in 
yeast, plasmids carrying Pcfll, pcfll (RW-A) fused to the Gal4 
DNA-binding domain (GBD), or only the GBD (none) and plasmid 
carrying Clpl fused to the Gal4 activation domain (pGAD-Clpl) were 
cotransformed into strain PJ69-4A. Ten-fold serial dilutions of the re- 
sulting strains were spotted on medium lacking uracil and leucine 
(CM-UL), as indicated. Activation of HIS3 expression was tested by 



C-terminal {clpl- AC) amino acids were not viable when 
the wild-type CLP1 plasmid was counter selected on 
5-FOA medium (Figure IB). Western blot analysis con- 
firmed that the mutant Clpl proteins were expressed, 
albeit at a reduced level for clpl -AN (Figure 1C). These 
results indicate an essential role for the N- and the 
C- termini in Clpl function. 

To further understand how Clpl works in mRNA 
3'-end formation, we next asked if Clpl interacts directly 
with any of the CPF subunits. By using a GST-Clpl 
fusion protein in pull-down assays, we detected clear inter- 
actions with CPF components Cftl, Ptal, Pfs2, Yshl and 
Ssu72, and no interaction with Ythl, Mpel, Papl, Cft2 
and Fipl (Figure ID). Using the same assay, we found the 
N- terminal region of Clpl is required for Yshl and Ssu72 
interaction and the central region is required for Cftl, 
Ptal and Pfs2 interactions (Figure ID). This analysis 
suggests that Clpl provides cross-factor connections to 
several CPF subunits. 

Weakening the Pcfll-Clpl interaction causes a 
thermosensitive growth phenotype 

Based on the structure of the Clpl-Pcfl 1 complex (28), we 
made several mutations in Clpl and Pcfll that could 
disrupt interaction of the two proteins. Three substitutions 
(S179E, S192R and R236A) were selected on the Clpl side 
of the interaction interface and introduced individually 
into the Clpl coding sequence (Figure 1A). However, 
strains carrying these mutants as the only forms of Clpl 
grew as well as wild-type on rich medium at 16, 23, 30, 37 
and 39°C. If indeed the Clpl-Pcfl 1 interaction is import- 
ant for function of the processing complex, contacts with 
Pcf 1 1 through other amino acids that were not mutated 
may be sufficient to maintain the interaction. 

There are two conserved residues in Pcfll (R480 and 
W482) that are extensively involved in hydrogen bonding 
and hydrophobic interactions at the protein-protein inter- 
face (28). Both side chains project into pockets formed on 
the surface of Clpl. Site-directed mutagenesis was used to 
replace both of these residues with alanine. This mutant 
strain, pcfll (RW-A), grew like wild-type at 16, 24 and 
30°C, but exhibited reduced growth at 37°C and no 
growth at 39° C (Figure 2A). Western blot analysis con- 
firmed that the growth phenotypes were not due to in- 
stability of the mutant pcfll protein (Figure 2B). 

To more directly examine the cause of the thermo- 
sensitive growth, we tested the ability of pcfll (RW-A) 



spotting the same strains on medium lacking uracil, leucine and histi- 
dine (CM-ULH), and incubated for 2-3 days at 30°C. (D) Expression 
of GBD-Pcfll and GAD-Clpl analyzed by western blot. Extracts 
prepared from the different two-hybrid strains grown at 30° C were 
analyzed by western blot using a monoclonal antibody recognizing 
either the GAL4 activation or DNA-binding domain. (E) Pulldowns 
of Pcfll/Clpl complexes in yeast extract. Extract was prepared from 
cells expressing V5-tagged forms of either Pcfll or pcfll (RW-A) and 
grown at 30° C. Immunoprecipitations from each extract were con- 
ducted using antibody against the V5 epitope, and blots of the 
precipitated proteins (IP) probed with Clpl or Pcfll antibody. The 
amount of Clpl in the different extracts is shown in the panel 
marked 'Input'. 
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to interact with Clpl by yeast two-hybrid analysis. 
Plasmids carrying the GAL4 DNA-binding domain fused 
to PCF11 (pGBDU-Pcfll) and plasmids carrying the 
GAL4 activation domain fused to CLP1 (pGAD-Clpl) 
were cotransformed into the two-hybrid tester strain 
PJ69-4A, and growth was examined at 30°C. All 
transformants grew well on media that selected for both 
plasmids, indicating that none of the fusion proteins 
were dominant-negative (Figure 2C). In addition, the 
GAD-Clpl and GBDU-Pcfll fusions could be detected 
by western blot analysis using antibodies specific to 
GAL4 binding and activation domains (Figure 2D). 
Positive two-hybrid interactions activate transcription of 
a HIS3 reporter gene and allow growth on medium 
lacking histidine (CM-ULH). pGAD-Clpl did not 
promote growth in the presence of pGBDU vector 
alone. When pGAD-Clpl was tested for interaction with 
the different pGBDU-Pcf 1 1 constructs, we found that in 
contrast to the wild-type Pcfll, pcfll(RW-A) did not 
allow growth in the absence of histidine. These results 
indicate that Pcfll amino acids R480 and W482 are 
needed for Clpl interaction. The loss of a two-hybrid 
interaction at 30° C but viability of the pcfll mutant at 
the same temperature suggests that other interactions 
within the processing complex can compensate for a 
weakened Pcfll -Clpl interaction at the lower tempera- 
ture. It is also possible that the GBD and GAD fusions 
to Clpl and Pcfl 1 interfere with such compensatory inter- 
actions, thus heightening an interaction defect when 
measured by two-hybrid analysis. 

To further confirm the weakened interaction between 
Clpl and the pcfll(RW-A) mutant protein, we performed 
pull-down assays from yeast extracts. The amount of Clpl 
found in extract is not affected by the pcfll mutation 
(Figure 2E). However, significantly less Clpl was 
observed in complex with pcfll(RW-A) compared to 
that associated with wild-type Pcfl 1 . 



Disruption of the Pcfll-Clpl interaction causes defects in 
mRNA 3 -end processing and transcription termination 

To more directly examine the cause of the thermosensitive 
growth phenotype, we tested the functionality of the 
pcfll (RW- A) allele using in vitro and in vivo assays for 
3'-end formation and transcription termination. Extracts 
prepared from cells expressing pcfll(RW-A) grown at 
30° C showed very little processing activity in a coupled 
cleavage-polyadenylation assay when compared to extract 
isolated from a wild-type strain (Figure 3 A, left panel). 
There was no accumulation of cleaved, unadenylated 
product, indicating that the cleavage step was defective. 
The mutant extract also exhibited much reduced activity 
when poly(A) addition was assayed on a precleaved sub- 
strate (Figure 3 A, right panel). Both cleavage and 
poly(A) addition in the pcfll (RW- A) extract could be 
recovered by the addition of a recombinant CF IA 
complex containing Pcfll, Clpl, Rnal4 and Rnal5 
(Figure 3B), indicating that the function of the CPF 
and Hrpl processing factors were not affected by the 
mutation in Pcfll. 

To test whether the pcfll (RW- A) mutation affected 
processing and transcription termination in vivo, we used 
a previously described system in which a poly(A) site is 
inserted into an intron upstream of the lacZ gene (40). 
Cells expressing wild-type Pcfll or pcfll(RW-A) were 
transformed with reporter plasmids that had either the 
3'-end of the ADH2 or GAL7 genes inserted into the 
intron, or as a control, one with no insertion, and levels 
of (3-galactosidase activity determined. As expected, in 
wild-type cells, the presence of either poly(A) site 
blocked read-through transcription into the lacZ gene 
(Figure 4A). The pcfll (RW-A) strain showed an 
increase of 3-fold in the level of read-through transcription 
compared to the wild-type for the ADH2 polyadenylation 
signal and an even greater increase (20 x) for the GAL7 




Coupled Poly(A) addition Coupled 

reaction reaction reaction 



Figure 3. Mutations in the Clpl interaction interface of Pcfll cause defects in mRNA 3'-end formation in vitro. (A) In vitro 3'-end processing assay. 
For coupled cleavage-polyadenylation assays (left side), extracts prepared from the strains containing the wild-type (PCF11) or the pcfll (RW-A) 
mutant grown at 30°C were incubated with ATP and 32 P-labeled full-length GAL7-1 RNA (Pre, unreacted precursor) for 20min at 30°C. The same 
conditions were used for poly(A) addition assays (right side) except that the precleaved GAL7-9 RNA was used as precursor. Products were resolved 
on a denaturing 5% polyacrylamide gel and visualized with a Phosphorlmager. Positions of substrate and product are indicated on the left and right. 
(B) The processing defect of pcfll (RW-A) is rescued by addition of CF IA. Coupled cleavage-polyadenylation assays were performed as described in 
Panel A using the indicated extracts. Recombinant CF I A containing wild-type Clpl, Pcfll, Rnal4 and Rnal5 was added to pcfll ( RW-A) extract 
before initiating the reaction (right-most lane). 
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Figure 4. Mutations in the Clpl interaction interface of Pcfll cause 
defects in mRNA 3' -end formation and transcription termination 
in vivo. (A) In vivo 3'-end processing and termination assay. Quantitative 
measurements of |3-galactosidase activity were performed using wild-type 
or mutant strains transformed with reporter vectors lacking or contain- 
ing an efficient polyadenylation signal upstream of the lacZ gene and 
grown at 30°C. The amount of read-through is expressed as a percentage 
of that obtained from vector lacking the poly(A) site. The data is the 
average of 48 independent assays for each strain. The numbers under the 
graph indicate the fold increase in read-through transcription over 
the wild-type value for each poly(A) site. (B) Schematic depiction of 
the PMA1 gene showing the position of its promoter (black box) and 
two polyadenylation sites (vertical arrows). The regions probed by ChIP 
are denoted 1-9. (C) Chromatin immunoprecipitation analysis of Rpb3 
crosslinking to PMA1. Cells are grown at 30° C and then shifted to either 
30°C or 39°C for an additional 60min. before formaldehyde 
crosslinking. Lane C denotes a non-transcribed region of chromosome 
V that serves as an internal background control. All PCR primer pairs 
used in this analysis are identical to those described previously (41). 
The input signal represents DNA prior to immunoprecipitation. 
(D) Quantification of the data shown in panel B. Rpb3 occupancy of 
each of the indicated regions of PMA1 was calculated as a sample 
IP:input ratio, normalized to the IP:input ratio of the untranscribed 
region of chromosome V (lane C in RPB3 and INPUT panels) and 
plotted as fold increase over background (ratio = 1.0). 



poly (A) site (Figure 4A). For comparison, we used the 
pcfll -9 allele, which has been previously shown to have 
cleavage and transcription termination defects (14,15). 
This mutation caused a greater read-through compared 
to the pcfll (RW- A) mutation, with a 6x increase for 
ADH2 and 68 x for GAL7 (Figure 4A). It is not known 
why termination driven by the GAL7 poly (A) site is more 
sensitive to mutations in PCF11. 

These results suggest that the defect in 3'-end processing 
observed with the pcf 1 1 (RW-A) mutant is also causing a 
defect in termination. This conclusion was confirmed by 
determining the chromatin immunoprecipitation (ChIP) 
profile of RNA Polymerase II (RNAP II) across the 
PMA1 gene (Figure 4B-D). In the pcfll mutant strain 
but not the wild-type, crosslinking of the RNAP II 
subunit Rpb3 extended into position 9, which is located 
downstream of the distal PMA1 poly(A) site. Together, 
these analyses indicate that the Pcfll-Clpl interaction is 
important for efficient 3'-end processing and termination 
of mRNA transcripts. 

Specific alterations in the Clpl-ATP-binding site give a 
non-functional protein 

The central domain of yeast and human Clpl possesses a 
canonical P-loop (GxxxxGKT/S) which spans G130 to 
T137 of yeast Clpll, along with the switch regions char- 
acteristic of the SIMIBI class of ATP/GTPases (44). To 
probe the function of this region of Clpl, site-directed 
mutagenesis was employed to introduce single or double 
amino acid substitutions at residues in yeast Clpl that 
should affect the ATP binding either at the nitrogenous 
base, the ribose or the phosphate (D33A, K72A, the 
double mutant D33A-K72A and K136A) based on the 
published structure (28). All of the mutants were viable 
in the absence of wild-type Clpl, and when tested on rich 
(YPD) medium at 16, 23, 30, 37 and 39°C, all of these clpl 
mutant strains exhibited growth that was indistinguishable 
from the corresponding wild-type strain. This finding is 
consistent with other studies showing that the amino 
acid substitutions G135R, K136A-T137A and D161A in 
yeast Clpl did not affect viability (29,31). This result was 
surprising given the conservation of these residues and 
previous reports that similar active site substitutions in 
known ATPases abolish ATP hydrolase activity (45,46). 
Similar replacements also interfere with the ability of T4 
polynucleotide kinase, tRNA ligases, the Grc3 RNA 
kinase recently implicated in termination of transcription 
by RNA polymerase I, and hClpl homologs to phosphor- 
ylate RNA (30-32,47). To more rigorously test the 
possible importance of ATP binding, we created three 
additional substitutions in Clpl (K136E, T137W and the 
double mutant K136E/T137W). These mutations, by 
substituting a positively charged amino acid for a nega- 
tively charged one (K136E) and a small amino acid for a 
bulky one (T137W) in the Walker A motif, were designed 
to be more disruptive to the ATP-binding site than alanine 
substitutions of the same residues. Unlike the previous 
mutations, none of these could support cell growth in 
the absence of wild-type Clpl on 5-FOA medium 
(Figure 5A). Western blot confirmed that the observed 



Nucleic Acids Research, 2012, Vol. 40, No. 3 1221 



A 

CLP1 

K136E 

T137W 

K136E, 
T137W 




FOA Plate 



Q 
O 



B 



o ^ 

2 d 



m 

CO 



K 
CO 



CO CO 
J- 





a-V5 



a-Pta1 



16.33 22.86 
time (minutes) 



Figure 5. Analysis of mutations in the ATP-binding site of Clpl. (A) Mutations predicted to displace ATP cause inviability. The XH5 
[YCpURA3-CLPl] plasmid shuffle strains expressing V5-tagged versions of wild-type Clpl or the indicated mutants (K136E, T137W and K136E/ 
T137W) were grown in selective medium, and 10-fold serial dilutions were spotted on 5-FOA plates and incubated for 4 days at 24° C. (B) In vivo 
expression of clpl mutants. Extracts from strains expressing either wild-type Clpl or the indicated mutants were analyzed by western blot using 
monoclonal antibody against the V5 tag or the CPF subunit Ptal as a loading control. (C) Wild-type Clpl and the clpl (K136E,T137W) mutant 
proteins contain equivalent amounts of ATP. Equal amounts of wild-type and mutant recombinant proteins were TCA precipitated and the 
supernatant applied to a DNA Pac PA-100 (Dionex) HPLC column. Elution profiles were recorded at 260 nm for ATP alone and for ATP extracted 
from the acid-precipitated Clpl mutant (red) and wild-type protein (green). ATP (blue) eluted at 0.57 M NaCl on a gradient of 0-1 M NaCl over 
30min. For clarity, the three traces were displaced with respect to one another on the F-axis. 



growth phenotypes were not due to instability of the 
mutant proteins, as the steady-state protein levels were 
comparable to wild-type (Figure 5B). In addition, 
circular dichroism analysis indicated that the K136E/ 
T137W mutant had similar secondary structure as 
wild-type Clpl (data not shown). The inviability indicates 
that the K136E and T137W substitutions in Clpl, which 
are predicted to displace ATP, interfere with one or more 
important functions of S. cerevisiae Clpl in vivo. 

ATP-binding assays were carried out to test whether 
the Clpl double mutant (K136E, T137W) impaired ATP 
binding. When applied to a y-ATP-Agarose resin column, 
both the wild-type and mutant proteins bound poorly, 
suggesting that exchange with free ATP might be very 
slow (data not shown). As an alternate approach, the 
amount of ATP bound to wild-type and mutant protein 
was assessed by reverse phase high-performance liquid 
chromatography (HPLC) analysis using recombinant 
protein, as described by Noble et al. (28). Similar 
amounts of ATP were extracted from the wild-type and 
mutant proteins (Figure 5C). This result shows that the 
ATP-binding capacity of the K136E, T137W protein is 
intact and that the inviability of the mutant is probably 
not due to loss of ATP binding. 

To determine what might be defective about a clpl 
mutant with K136E and T137W replacements, we 
examined the interaction with Pcfll by two-hybrid 
analysis (Figure 6A). The single mutations caused a 



noticeably weaker interaction with Pcfll, and the inter- 
action with the double mutant was significantly worse. To 
confirm the loss of interaction observed by two-hybrid 
analysis, we performed co-immunoprecipitations from 
yeast extracts. As expected, the clpl (K136E, T137W) 
mutant protein is defective for Pcf 1 1 interaction in com- 
parison to wt Clpl (Figure 6B). These results suggest that 
how ATP sits in its Clpl -binding site affects specific inter- 
actions on the Clpl surface. 

DISCUSSION 

The polyadenylation of mRNA 3'-ends in eukaryotes 
requires a surprisingly large complex, which in yeast is 
composed of four separate, but interacting factors, 
namely, CF IA, CPF, Hrpl and Pabl (1,2) (Figure 7A). 
All but two of the 21 individual subunits are essential. A 
challenge in the field has been to assign functions to these 
numerous proteins and to understand how the arrange- 
ment of proteins within the multisubunit CPF and CF 
IA contributes to the activity of each factor. Several 
subunits have enzymatic activities, such as the nuclease 
Yshl, the poly(A) polymerase Papl, and two phosphat- 
ases (Ssu72 and Glc7). Some subunits contact processing 
signals on the RNA precursor, while others, such as Ptal, 
Cftl, Pcfll and Rnal4, are thought to have scaffold func- 
tions in organizing the complex and in physically coupling 
processing to transcription and export (17,22,48-50). In 
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Figure 6. Clpl proteins with K136E and T137W replacements are de- 
fective for Pcf 1 1 interaction. (A) Two-hybrid analysis of the interaction 
between Pcf 11 and different forms of Clpl. Two-hybrid analysis was 
performed as described in Figure 2C. (B) Pulldowns of Pcfll-Clpl 
complexes in yeast extract. Extract was prepared from cells expressing 
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this study, we further our understanding of the role of the 
Clpl subunit of CF IA in the organization and activity of 
the eukaryotic polyadenylation complex and discuss the 
implications of our findings below. 

Cross-factor contacts 

Processing of mRNA 3'-ends is initiated when the factors 
that comprise the cleavage-polyadenylation complex rec- 
ognize the constellation of signal sequences that surround 
the poly (A) site. Cleavage removes at least one of these 
signals, yet the complex must remain in stable association 
with the RNA so that poly(A) addition is efficient. Most 
of the yeast and mammalian factors do not have strong 
affinity for their cognate sequences, and assembly of the 
active complex is facilitated by cross-factor interactions. 
The possible proteins mediating inter-factor contacts have 
been most extensively investigated in the yeast system 
(Figure 7A). For example, the Rnal4 subunit of CF I A 
can bridge to Hrpl and also to CPF through association 
with Fipl, Pfs2 and Cftl (22,48,51,52). Pcfl 1 can associate 
with Cftl, Cft2, Yshl and Ptal (17,19) and contacts 
between Rnal5 and Hrpl are induced when both 
proteins bind RNA (23). It is important to note that 
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Figure 7. Regions and interactions of the yeast Clpl. (A) Summary of 
physical interactions known to occur between subunits of CF IA and 
other factors, indicated by connecting lines (Clpl interactions, dashed 
line; Pcfll interactions, solid black line; Rnal4 interactions, dotted line; 
Rnal5 interactions, gray line). These interactions include CPF subunits 
(19,51,52), and this study), Hrpl (26), Pabl (11), the CTD of the Rpbl 
subunit of RNA Polymerase II (RNAP II) (20,21), and Yral of the 
mRNA export machinery (49). CPF subunits are indicated as rect- 
angles; interactions of CF IA with CPF subunits in the right column 
have not been demonstrated. (B) Schematic diagram of Clpl 
summarizing the regions needed for interaction with different CPF 
subunits. Pcfll interacts with amino acids primarily in the central 
domain of Clpl but partly in the CTD (28). (C) Model of the Clpl 
ATP-binding site, based on the published structure (28). The locations 
of K136 and T137 are indicated. Parts of the surface composed of 
carbon are colored green, oxygen is red, and nitrogen is blue. 



evidence for these interactions comes from in vitro 
assays using individual subunits, and none of the 
contacts have been confirmed to occur in an assembled 
processing complex. 

In this study, we show that Clpl can directly interact 
with the Pfs2 and Ssu72 subunits of CPF, in addition 
to the previously reported contacts with Cftl, Ptal and 
Yshl (17,19), and does so through different domains 
(Figure 7B). However, Clpl does not interact with Ythl, 
Mpel, Papl, Cft2 and Fipl. hClpl, a subunit of the mam- 
malian CF II m factor, interacts with CF I m and CPSF 
(53), and the Arabidopsis counterpart also interacts with 
CPSF (54,55). CPSF contains homologs of Cftl, Cft2, 
Fipl, Yshl and Ythl. Together, these findings support 
the idea that one of the conserved functions of Clpl is 



Nucleic Acids Research, 2012, Vol. 40, No. 3 1223 



to bridge different subcomplexes of the processing ma- 
chinery. Interestingly, hClpl cannot replace yeast Clpl 
in vivo and unlike the yeast Clpl, is only needed for the 
cleavage step and not for poly(A) addition (31,53), sug- 
gesting important differences may exist in the protein 
interaction networks of these two homologs. 

Consistent with the idea that the processing complex 
assembles on precursor RNA and then disassembles into 
individual factors once poly(A) addition is complete, none 
of the interactions of Clpl with CPF subunits are as 
strong as its interaction with Pcfll, a contact conserved 
in the mammalian apparatus. The Pcfl 1 -binding site is 
located in a groove that is partly on the surface of the 
central domain of Clpl and partly at the interface of the 
central domain and the CTD (28). Our site-directed mu- 
tagenesis (Figure 2C) shows that this conserved inter- 
action interface is indeed important for association of 
these two CF IA subunits, and that disruption of this 
contact interferes with CF IA function in mRNA 3'-end 
processing and transcription termination downstream of 
poly (A) sites (Figures 3 and 4). These findings are com- 
plementary to a concurrent study by Haddad et al. (R. 
Haddad, F. Maurice, N. Viphakone, F. Voisinet-Hakil, 
S. Fribourg and L. Minvielle-Sebastia, in press), who 
showed that mutations in Clpl that disrupt Pcfll inter- 
action are similarly defective for cleavage and poly(A) 
addition. Moreover, they found that depletion of Clpl is 
detrimental to sn/snoRNA termination. This loss of Clpl 
did not perturb the levels of Pcfll, Rnal4 and Rnal5, 
suggesting that Clpl, like the other CF IA subunits, also 
has an important role in termination of non-adenylated 
RNAP II transcripts. 

Function of the ATP-binding site of yeast Clpl 

Clpl has been an intriguing component of the 3'-end pro- 
cessing complex because it contains a Walker A motif 
characteristic of the P-loop found in ATP or 
GTP-binding proteins, but no other motifs suggestive of 
function. Determination of the structure of Clpl showed 
that it does indeed bind ATP (28), and the mammalian 
homolog was found to provide RNA kinase activity 
needed for one of the tRNA splicing pathways and for 
activating synthetic siRNAs for incorporation into 
silencing complexes (31,32). The alternate mechanism of 
tRNA exon ligation thought to use hClpl does not occur 
in S. cerevisiae, and mutations in the ATP pocket that 
should destroy this activity had no effect on Clpl 
function in yeast (31) and this study). Noble et al. (28) 
found that ATP hydrolysis by Clpl was not activated by 
the other CF IA subunits, and proposed that the contacts 
of Clpl with CPF might trigger nucleotide hydrolysis as 
well as exposure of the ATP-binding site to allow for ADP 
dissociation and ATP rebinding. Still, one would expect 
that the mutations in conserved P-loop residues would 
have growth defects if enzymatic activity was important 
for the essential process of mRNA polyadenylation. 

In this study, we discovered that more disruptive muta- 
tions (K136E, T137W) in the P-loop of Clpl were lethal, 
but did not result in loss of ATP binding (Figure 5). In an 
effort to explain this surprising result, we modeled the 3D 



structure of the K136E, T137W mutant. The modeling 
confirmed that the mutated residues should interfere 
with the ATP conformation observed in the crystal struc- 
ture. We also noticed that the ATP-binding site in 
wild-type Clpl is significantly larger than necessary 
(Figure 7C). It is larger on both base and gamma phos- 
phate ends of the nucleotide, and has an additional empty 
space extending away from the ribose. In addition, the 
phosphate-binding pocket appears both looser and less 
basic than is common. Surprisingly, the extra space 
around the phosphates is not filled with water molecules. 
Whereas yeast Clpl appears to be a catalytically dead 
molecule, the mammalian homolog is an active kinase 
that phosphorylates the 5'-OH of RNA (31,32). Thus 
the extra space near the ATP-binding site may be a 
vestige of the RNA-binding site that is likely present in 
active homologs of yeast Clpl. However, with only 24% 
identity between the yeast and human proteins, further 
meaningful comparison will require a structure of hClpl. 
We have also not detected any interaction of Clpl and 
RNA by UV crosslinking (22) or by non-denaturing gel 
electrophoresis (data not shown). 

The ATP-binding cavity of Clpl most closely resembles 
that of two characterized ATPases: the NifH Fe subunit of 
the Azobabactr vinelandii nitrogenase and the E. coli de- 
toxification pump protein ArsA (28). ATP hydrolysis by 
these enzymes transmits conformational changes in the 
macromolecular complexes in which NifH and ArsA 
reside. Our analysis of the K136E, T137W mutant 
suggests that Clpl may function in a similar way, but 
one that does not require catalysis. Our data show that 
the contact with Pcfll was dramatically weakened 
(Figure 6), even though the ATP-binding site is well 
removed from the binding site of Pcfll (28). This 
mutation does not abrogate nucleotide binding 
(Figure 5), suggesting a specific change in the conform- 
ation of the protein rather than gross misfolding of the 
central domain. Interestingly, Haddad et al. (29) have 
shown that nonlethal mutations in the P-loop domain of 
Clpl will impair 3'-end processing if they are paired with a 
mutation in the Pcfl 1 -binding region. This combination of 
mutations not only disrupts the binding of Pcfll to Clpl, 
and but also its binding to the Rnal4-Rnal5 subcomplex, 
suggesting that the Clpl -Pcfll interaction is modulated 
by how Clpl binds ATP, which in turn causes a conform- 
ational change in Pcfl 1 that promotes its interaction with 
Rnal4-Rnal5. 

One interpretation of these observations is that the 
ATP-binding site of Clpl functions as a molecular 
switch in which the mode of ATP binding induces con- 
formational changes necessary for processing activity. A 
repositioning of the ATP mimicked by the K136E, T137W 
mutations could occur naturally by opening of the lid 
formed by the N-terminal domain, which on its inner 
surface makes contact with the adenine base and 2! OH 
of the ribose (28). Ssu72 and the endonuclease Yshl, two 
subunits of CPF essential for the cleavage step, interact 
with the N-terminal domain and could mediate such a 
repositioning, which in turn would loosen the interaction 
with Pcfl 1 . It is not clear how these changes might help 
the complex move through a cycle of processing, but our 
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findings with the pcfll(RW-A) mutant (Figures 3 and 4) 
show that weakening the association of Clpl and Pcfll 
affects cleavage, poly(A) addition and termination down- 
stream of poly(A) sites. In the lethal K136E, T137W 
mutant, we would predict that processing would be 
blocked at some point because the reorientation of the 
ATP is not reversible. 

It is also interesting to note that Pcf 1 1 is not an obliga- 
tory partner for Clpl. hPcfll, while binding Clpl in the 
mRNA cleavage factor CF II m , is not found with Clpl in 
the human tRNA splicing endonuclease complex (32,53). 
Furthermore, the Clpl interaction region on Pcfll 
overlaps with that of the mRNA export factor Yral 
(49), and it has been proposed that Clpl exchanges with 
Yral in a transition that leads to 3'-end processing and 
mRNP maturation when the transcription elongation 
complex reaches the end of a gene. While it is conceivable 
that the two proteins can bind Pcf 1 1 simultaneously, the 
ATP binding of Clpl could also have a role in switching 
Pcfll partners. Consistent with this possibility, the Clpl- 
Pcfll interaction is important in recruiting Rnal5 to 
the RNAP II elongation complex (29). 

In summary, our findings support a model in which 
Clpl has a dynamic structural role in the assembly of 
the mRNA cleavage-polyadenylation complex on the 
poly(A) site and subsequent processing at that site. 
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